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TESTING FOR JUMPS IN A DISCRETELY OBSERVED PROCESS 

By Yacine Ait-Sahalia 1 and Jean Jacod 
Princeton University and Universite Pierre et Marie Curie 

We propose a new test to determine whether jumps are present in 
asset returns or other discretely sampled processes. As the sampling 
interval tends to 0, our test statistic converges to 1 if there are jumps, 
and to another deterministic and known value (such as 2) if there 
are no jumps. The test is valid for all Ito semimartingales, depends 
neither on the law of the process nor on the coefficients of the equation 
which it solves, does not require a preliminary estimation of these 
coefficients, and when there are jumps the test is applicable whether 
jumps have finite or infinite-activity and for an arbitrary Blumenthal- 
Getoor index. We finally implement the test on simulations and asset 
returns data. 

1. Introduction. The problem of deciding whether the continuous-time 
process which models an economic or financial time series should have con- 
tinuous paths or exhibit jumps is becoming an increasingly important issue, 
in view of the high-frequency observations that are now widely available. In 
the case where a large jump occurs, a simple glance at the dataset might be 
sufficient to decide this issue. But such large jumps are usually infrequent, 
may not belong to the model itself, can be considered as breakdowns in 
the homogeneity of the model, or may be dealt with separately using other 
methods such as risk management. 

On the other hand, a visual inspection of most such time series in practice 
does not provide clear evidence for either the presence or the absence of 
small or medium sized jumps. Since small frequent jumps should definitely 
be incorporated into the model, and since models with and without jumps 
do have quite different mathematical properties and financial consequences 
(for option hedging, portfolio optimization, etc.), it is important to have 
statistical methods that can shed some light on the issue. 
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Determining whether a process has jumps has been considered by a num- 
ber of authors. A'it-Sahalia (2002) relies on restrictions on the transition 
function of the process that are compatible with continuity of the process, or 
lack thereof, to derive a test for the presence of jumps at any observable fre- 
quency. Using high-frequency data, Carr and Wu (2003) exploit the differen- 
tial behavior of short dated options to test for the presence of jumps. Multi- 
power variations can separate the continuous part of the quadratic variation; 
see Barndorff-Nielsen and Shephard (2006). Andersen, Bollerslev and Diebold 

(2003) and Huang and Tauchen (2006) study financial datasets using mul- 
tipower variations, in order to assess the proportion of quadratic variation 
attributable to jumps. Jiang and Oomen (2005) construct a test motivated 
by the hedging error of a variance swap replication strategy. Other methods 
have been introduced as well [see, e.g., Lee and Mykland (2008)] and the 
literature about evaluating the volatility when there are jumps [see, e.g., 
A'it-Sahalia (2004), A'it-Sahalia and Jacod (2007), Mancini (2001), Mancini 

(2004) and Woerner (2006b)] can also be viewed as an indirect way of check- 
ing for jumps. Woerner (2006a) proposes estimators of the Blumenthal- 
Getoor index or the Hurst exponent of a stochastic process based on high- 
frequency data, and those can be used to detect jumps and their intensity. 

These are two closely related but different issues: one is to decide whether 
jumps are present or not, another one is to determine the impact of jumps 
on the overall variability of the observed process. So far, most of the litera- 
ture has concentrated on the second issue, and it usually assumes a special 
structure on the process, especially regarding its jump part if one is present, 
such as being a compound Poisson, or sometimes a Levy process. We plan 
to have a systematic look at the question of measuring the impact of jumps 
in a future paper, using a variety of distances and doing so under weak as- 
sumptions, and also taking into account the usually nonnegligible market 
microstructure noise. 

In this paper, we concentrate on the first problem, namely whether there 
are jumps or not, while pretending that the underlying process is perfectly 
observed at n discrete times, and n is large. We introduce a direct and very 
simple test which gives a solution to this problem, irrespective of the precise 
structure of the process within the very large class of ltd semimartingales. 

More specifically, a process X = (Xt) on a given time interval [0,T] is 
observed at times iA n for A n = T/n. We propose an easy-to-compute family 
of test statistics, say S n , which converge as A n — > to 1 if there are jumps, 
and to another deterministic and known value (such as 2) if there are no 
jumps. This holds as soon as the process X is an Ito semimartingale, and 
it depends neither on the law of the process nor on the coefficients of the 
equation which it solves, and it does not require any preliminary estimation 
of these coefficients. We provide a central limit theorem for S n , under both 
alternatives (jumps and no jumps); again this does not require any a priori 
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knowledge of the coefficients of the model, and when there are jumps it 
is applicable whether the jumps have finite or infinite activity, and for an 
arbitrary Blumenthal-Getoor index. Hence we can construct tests with a 
given level of significance asymptotically and which are fully nonparametric. 

The paper is organized as follows. Sections 2 and 3 describe our setup and 
the statistical problem. We provide central limit theorems for our proposed 
statistics in Section 4, and use them to construct the actual tests in Section 
5. We present the results of Monte Carlo simulations in Section 6, and in 
Section 7 examine the empirical distribution of the test statistic over all 
2005 transactions of the Dow Jones stocks. Proofs are in Section 8. 

2. Setting and assumptions. The underlying process X which we ob- 
serve at discrete times is a one-dimensional process which we specify below. 
Observe that taking a one-dimensional process is not a restriction in our con- 
text since, if it were multidimensional, a jump would necessarily be a jump 
for at least one of its components, so any test for jumps can be performed 
separately on each of its components. 

As already mentioned, we do not want to make any specific model as- 
sumption on X, such as assuming some parametric family of models. We 
do need, however, a mild structural assumption which is satisfied in all 
continuous-time models used in finance, at least as long as one wants to 
rule out arbitrage opportunities. In any case, in the absence of some kind 
of assumption, anything can happen from a continuous process which is a 
linear interpolation between the observations, to a pure jump process which 
is constant between successive observation times. 

Our structural assumption is that X is an ltd semimartingale on some 
filtered space (O, J 7 , (J^) t >o,P), which means it can be written as 



where W and \x are a Wiener process and a Poisson random measure on x 
E with (E,£) an auxiliary measurable space on the space (O, {Ft)t>o^) 
and the predictable compensator (or intensity measure) of ji is v(ds,dx) = 
ds0X(dx) for some given finite or cr-finite measure on (E,£). Moreover k is a 
continuous function with compact support and k(x) = x on a neighborhood 
of 0, and k'(x) = x — k(x). 

Above, the function k is arbitrary; a change of this function amounts to a 
change in the drift coefficient bt- Also, it is always possible to take (E,£, A) 
to be E equipped with Lebesgue measure. 

Of course the coefficients bt(uj), <?t{oo) and 8(u>,t,x) should be such that 
the various integrals in (1) make sense [see, e.g., Jacod and Shiryaev (2003) 
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for a precise definition of the last two integrals in (1)], and in particular bt 
and at are optional processes and 5 is a predictable function. However, we 
need a bit more than just the minimal integrability assumptions, plus the 
nontrivial fact that the volatility at is also an Ito semimartingale, of the 
form 

a t = a + f b s ds+ f a s dW s + f a' s dW' s 
Jo Jo Jo 

(2) + / / k ° 8(s,x)(n — v)(ds,dx) 

Jo Je 

+ / / k o 5(s, x)n(ds, dx), 
Jo Je 

where W 1 is another Wiener process independent of (W, fi). 

All our additional requirements are expressed in the next assumption, for 
which we need a few additional notations. We write 

(3) AX S = X S - X s _ 

for the jumps of the X process. Of course, AX S = for all s when X is 
continuous, and for all s outside a (random) countable set in all cases. Let 



(4) S' t (co) = 




if the integral makes sense, 
otherwise 



and r = inf (t; AXt ^ 0). In the finite-activity case for jumps we have r > 
a.s., whereas in the infinite-activity case we usually (but not necessarily) 
have r = a.s. Observe that the fifth term in (2) is a.s. equal to — Jq 5' s ds 
for all t in the (possibly empty) set [0, r]; so outside a null set in (uj,t) for 
the product measure ¥(duj) ®dt, the process 5' t (u>) takes finite values for all 
t < t(uj), whereas we may very well have S' t (uj) = oo for all t > t(uj). 

Assumption 1. (a) All paths t^bt{oo) are locally bounded. 

(b) All paths 1 1 — > bt(u>), t at(u>), t \—> a' t {to) are right-continuous with 
left limits. 

(c) All paths 1 1 — > 5(u>, t, x) and t \—> 5(uj, t, x) are left-continuous with right 
limits. _ 

(d) All paths 1 1— > svlQx^e ^rfj) anc ^ * l— * su Px-e£ ^7x) are l° can y bound- 
ed, where 7 is a (nonrandom) nonnegative function satisfying J E ('j(x)' 2 A 
l)X(dx) < 00. 

(e) All paths 1 1— > S' t (uj) are left-continuous with right limits on the semiopen 
set [0,t(cj)). 

(f) We have Jq |o" s | ds > a.s. for all t > 0. 
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The nondegeneracy condition (f) says that almost surely the continuous 
martingale part of X is not identically on any interval [0,t] with t > 0. 
Most results of this paper are true without (f), but not all, and in any 
case this condition is satisfied in all applications we have in mind. Apart 
from this nondegeneracy condition, Assumption 1 accommodates virtually 
all models for stochastic volatility, including those with jumps, and allows for 
correlation between the volatility and the asset price processes. For example, 
if we consider a d-dimensional equation 



where Z is a multidimensional Levy process with Gaussian components, and 
/ is a C 2 function with at most linear growth, then any of the components of 
Y satisfies Assumption 1 except perhaps (e); this comes from Ito's formula 
and from the representation of a Levy process in terms of a Wiener process 
and a Poisson random measure. The same holds for more general equations 
driven by a Wiener process and a Poisson random measure. 

Except when stated otherwise, we will maintain Assumption 1 throughout 
the paper and will therefore omit it in the statements of the results that 
follow. 

3. The statistical problem. 

3.1. Preliminaries. Our process X is discretely observed over a given 
time interval [0, t] , and it is convenient in practice to allow t to vary. So we 
suppose that X can be observed at times iA n for alH = 0, 1, . . . and we take 
into account only those observation times iA n smaller than or equal to t, 
whether t is a multiple of A n or not. Moreover the testing procedures given 
below are "asymptotic," in the sense that we can specify the level or the 
power function asymptotically as n — > oo and A n — > 0. 

Recall that we want to decide whether there are jumps or not, for the 
process (1); equivalently, we want to decide whether the coefficient 5 is 
identically or not. A few remarks can be stated right away, which show 
some of the difficulties or peculiarities of this statistical problem: 

1. It is a nonparametric problem: we do not specify the coefficients b, a, 5. 

2. It is an asymptotic problem, which only makes sense for high-frequency 
data. 

3. When "n is infinite," that is, in the ideal although unrealistic situation 
of a complete observation of the path of X over [0, t] , we can of course 
tell whether our particular path has jumps or not. However, when the 
measure A is finite there is a positive probability that the path X(co) has 
no jump on [0,t], although the model itself may allow for jumps. 



(5) 
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4. In the realistic case n < oo we cannot do better than in the "completely 
observed case." That is, we can hopefully infer something about the jumps 
which actually occurred for our particular observed path, but nothing 
about those which belong to the model but did not occur on the observed 
path. 

5. Rather than "testing for jumps," there are cases in practice where one 
wants to estimate in some sense the part of the variability of the process 
which is due to the jumps, compared to the part due to the continuous 
component. This is what most authors have studied so far, in some special 
cases at least, and we will take a systematic look at this question in our 
further paper in preparation on the topic, using essentially the same tools 
as in the present paper. However, under the null hypothesis that jumps are 
present, it is not clear how one should go about specifying the proportion 
of quadratic variation attributable to jumps without already assuming 
not only the type but also the "quantity" of jumps, for instance when 
using statistics such as those in Barndorff-Nielsen and Shephard (2006). 
The test we propose below does not have this problem. 

6. Coming back to testing for jumps, an important property of test statistics 
is that they should be scale-invariant (invariant if X is multiplied by an 
arbitrary constant). It would also be desirable for the limiting behavior 
of the statistic to be independent of the dynamics of the process. We will 
see that our test has all these features. 

In view of comments 3 and 4 above, the problem which we really try to 
solve in this paper is to decide, on the basis of the observations Xi& n which 
belong to the time interval [0, t], in which of the following two complementary 
sets the path which we have discretely observed falls: 



whereas if we decide on fi£ it does not mean that the model is continuous, 
even on the interval [0,t] (of course, in both cases we can say nothing about 
what happens after t\). 

3.2. Measuring the variability of X. Let us now introduce a number 
of processes which all measure some kind of variability of X, or perhaps 
its continuous and jump components separately, and depend on the whole 
(unobserved) path of X: 





If we decide on Qi 



then we also implicitly decide that the model has jumps 





s<t 



where p is a positive number. 
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Note that A(p) is finite- valued for all p > (under Assumption 1), where- 
as B(p) is finite- valued if p > 2 but often not when p < 2. Also, recall that 
[X,X] =A{2) + B{2). 

We have £l J t = {B(p)t > 0} for any p > 0, so in a sense our problem "sim- 
ply" amounts to determining whether B(p)t > for our particular observed 
path, and with any prespecified p. Moreover a reasonable measure of the rela- 
tive variability, or variance, due to the jumps is B(2) t /[X,X] t , and this is the 
measure used, for example, by Huang and Tauchen (2006); other measures 

of this variability could be B(p)t/A(p)t or B(p)t/[X, X]^ 2 for other values 
of p (the power p/2 in the denominator is to ensure the scale- invariance). 

In any event, everything boils down to estimating, on the basis of the 
actual observations, the quantity B(p) t in (7), and the difficulty of this 
estimation depends on the value of p. Let 

( 8 ) A?X = X iAn - ^(i-i)A„ 

denote the observed discrete increments of X [all of them, not just those 
due to jumps, unlike (3)] and define for p > the estimator 

[t/A„] 

(9) B(p,A n ) t := \ A i X \ P - 

i=i 

For r G (0,oo), let 

(10) m r =E{\U\ r )=ir- 1/2 2 r / 2 r(^^ S j 

denote the rth absolute moment of a variable U ~ iV(0, 1). We have the 
following convergences in probability, locally uniform in t: 



(11) 



p>2 B(p,A n ) t ^B(p) t , 
p = 2 =► B(p,A n ) t ^[X,X] t , 

A l-p/2 

P<2 => — B(p,A n ) t ^A(p) t , 

m p 

X is continuous B(p, A n ) t — > A(p) t . 



These properties are known: for p = 2 this is the convergence of the re- 
alized quadratic variation, for p > 2 this is due to Lepingle (1976) for all 
semimartingales, and for the other cases one may see, e.g., Jacod (2008). 

The intuition for the behavior of B(p,A n ) t is as follows. Suppose that X 
can jump. Among the increments of X, there are those that contain a large 
jump and those that do not. While the increments containing large jumps are 
much less frequent than those that contain only a Brownian contribution and 
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many small jumps, or only a Brownian contribution when A is finite, they 
are so much bigger than the rest that when jumps occur, their contribution 
to B(p) for p > 2 overwhelms everything else. This is because high powers 
(jp > 2) magnify the large increments at the expense of the small ones. Then 
the sum behaves like the sum coming from the jumps only; this is the first 
result in (11). When p is small (p < 2), on the other hand, the magnification 
of the large increments by the power is not strong enough to overcome the 
fact that there are many more small increments than large ones. Then the 
behavior of the sum is driven by the summation of all these small increments; 
this is the third result in (11). When p = 2, we are in the situation where 
these two effects (magnification of the relatively few large increments vs. 
summation of many small increments) are of the same magnitude; this is 
the second result in (11). When X is continuous, we are only summing 
small increments and we get for all values of p the same behavior as the one 
in the third result where the summation of the small increments dominates 
the sum; this is the fourth result in (11). 

3.3. The test statistics. Based on this intuition, upon examining (11), we 
see that when p > 2 the limit of B(p, A n )t does not depend on the sequence 
A n going to 0, and it is strictly positive if X has jumps between and t. 
On the other hand, when X is continuous on [0,t], then B(p,A n )t converges 
again to a limit not depending on A n , but only after a normalization which 
does depend on A n . 

These considerations lead us to compare B(p,A n )t on two different A n - 
scales. Specifically, we choose an integer k and compare B(p,A n )t with 
B(p,kA n )t, the latter obtained by considering only the increments of X 
between successive multiples of kA n . Then we set 

(12) S{ P ,k,An)t = S ^ kAn)t 

{ ' ^ nH B(p,A n ) t 

as our (family of) test statistics. 

In view of the first and fourth limits in (11), we readily get: 

Theorem 1. Let t > 0, p> 2 and k>2. Then the variables S(p,k,A n ) t 
converge in probability to the variable S(p, k)t defined by 



(13) S(p,k) t 



\ 1, on the set Cl J t , 

\k p / 2 -\ on the set Of. 



(On the set Of, the convergence holds for p < 2 as well.) Therefore our test 
statistics will converge to 1 in the presence of jumps and, with the selection 
of p = 4 and k = 2, to 2 in the absence of jumps. The following corollary is 
immediate: 
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Corollary 1. Let t > 0, p > 2 and k>2. The decision rule defined by 

/~v / \ (~v, / i ,x I X is discontinuous on \0,t], if S(p,k, A n ) t < a, 
<s{n) = <s{n,t,p,k) = < 11 ~; ' 

I J is continuous on [0,t], if S(p,k,A n )t> a 

is consistent, in the sense that the probability that it gives the wrong answer 
tends to as A n — > 0, for any choice of a in the interval (1, k v l 2 — 1). 



4. Central limit theorems. 



4.1. CLT for power variations. The previous corollary provides the first 
step toward constructing a test for the presence or absence of jumps, but it is 
hardly enough. To construct tests, we need to derive the rates of convergence 
and the asymptotic variances when X jumps and when it is continuous. 

We start with the following general theorem, to be proved in Section 8 
(as above, k is an integer larger than 1). The asymptotic variances in the 
theorem involve the process A{p)t already defined in (7) as well as the more 
complex process 

(14) D(p) t = J2\^X s \^a 2 s _ + a 2 s ) 

s<t 

for p > 0. Like B(p) t , D(p) t is finite-valued if p > 2 but often not when p < 2. 

Theorem 2. (a) Let p > 3. For any t>0 the pair of variables 

A-^iB^A^t - B(p) t ,B(p, kA n ) t - B(p) t ) 

converges stably in law to a bidimensional variable of the form (Z(p)t, Z(p)t + 
Z'(p,k)t), where both Z{p)t and Z'(p,k)t are defined on an extension 
{^t)t>Qi IP) of the original filtered space (0, J 7 , {Ft)t>o-, P) and conditionally 
on J- are centered, with Z'(p)t having the following conditional variance: 

(15) nz'(p,k) 2 t | F) = ^lp 2 D(2p-2) t . 

Moreover if the processes a and X have no common jumps, the variable 
Z'{p)t are T- conditionally Gaussian. 

(b) Assume in addition that X is continuous, and let p > 2. The pair of 
variables 

A- 1 / 2 ^/ 2 B(p, A n ) t - m p A(p) t ,Ai~^ 2 B(p, kA n ) t - k^ 2 - l m p A{p) t ) 

converges stably in law to a bidimensional variable (Y \p)t,Y' (p, k)t) defined 
on an extension (0,.F, (J r t)f>o,IP) of the original filtered space (J 7 t)t>o,^') 
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and which conditionally on J 7 is a centered Gaussian variable with variance- 
covariance given by 



for U , V independent N(0, 1) variables. 

The "stable convergence in law" mentioned above is a mode of conver- 
gence introduced by Renyi (1963), which is slightly stronger than the mere 
convergence in law, and its important feature for us is that if V n is any 
sequence of variables converging in probability to a limit V on the space 
(0, JT, (J r t ) t > ,P), whereas the variables converge stably in law to V , 
then the pair (V n ,V^) converges stably in law again to the pair (V, V). 

The property "the processes a and X have no common jumps" may hold, 
even though both processes are driven by the same Poisson measure; indeed, 
it holds when the product 55 vanishes identically, or more generally when 
(55)(lo, s, z) = for ¥(duj) x ds x \{dz) almost all (uj,s,z). Because of the 
freedom we have in the choice of the driving Poisson measure, in this case 
we can also use two independent Poisson measures fi and ju to drive a and 
X, without changing the law of the pair (X, a), whereas conversely if this is 
true we can "aggregate" the two measures \x and Jl into a single one. 

The reader will observe the restriction p > 3 in (a) above: the CLT simply 
does not hold if p < 3, when there are jumps. On the other hand, (b) holds 
also if p < 2, under the additional assumption that a does not vanish, but 
we do not need this improvement here. 

4.2. CLT for the nonstandardized statistics. This theorem allows us to 
deduce a CLT for our statistics S(p,k,A n ) t : 

Theorem 3. (a) Letp>3 and t > 0. Then A„ 1 ^ 2 (S(p, k, A n ) t - 1) con- 
verges stably in law, in restriction to the set Q? t of (13), to a variable S(p, k)\ 
which, conditionally on T , is centered with variance 



(16) 



r E(Y(p)l\F) = (m 2p -rn 2 p )A(2p) t , 
< E(Y'(p, kf t | F) = kr>~ l {m 2p - m 2 p )A(2p) t , 
, E(Y(p) t Y'(p, k) t | T) = {m kiP - kP/ 2 m 2 p )A(2p) t 



and where 



(17) 



m k)P = E{\U\ P \U + Vk~^lV\ p ) 



(18) 



E((S(p,k)i) 2 \F) 



(k-l)p 2 D(2p-2) 
2~ BW 



Moreover if the processes a and X have no common jumps, the variable 
S(p, k)\ is T -conditionally Gaussian. 
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(b) Assume in addition that X is continuous, and let p > 2 and t > 
0. Then A n 1 ^ 2 (S(p, k, A n )t — kP/ 2 ' 1 ) converge stably in law to a variable 
S(p, k)1 which, conditionally on T , is centered normal with variance 



k)tf\f) = M(p,k)^}, 
where 

(20) M(p, k) = -^(F" 2 (l + k)m 2p + kP~\k - l)ml - 2k p l 2 - l m k)P ). 

Note that (a) and (b) are not contradictory, since £l J t = 0, when X is 
continuous. It is also worth noticing that the conditional variances (18) and 
(19), although of course random, are more or less behaving in time like 1/t. 

4.3. Consistent estimators of the asymptotic variances. To evaluate the 
level of tests based on the statistic S(p, k, A n ) t , we need consistent estimators 
for the asymptotic variances obtained in Theorem 3. That is, we will need 
to estimate D{p) when p>2 and when there are jumps, and also for A(p) 
when p > 2 and X is continuous. 

To estimate A(p), we can use a realized truncated pth variation: for any 
constants a > and w € (0, i), we have from Jacod (2008) that, if either 
p = 2, or p > 2 and X is continuous, then 

a1-p/2 [*/An] 

(21) A(p, A n ) t := _= \^iX\ P l{\A n X\<aA%} A(p) t . 

m p i=l 

Alternatively, we can use the multipower variations of Barndorff-Nielsen 
and Shephard (2006). For any r G (0, oo) and any integer q > 1, we have 
from Barndorff-Nielsen et al. (2006a) that if X is continuous 

a1-9T/2 [t/A n ]-q+l q 

(22) A(r,q,A n ) t :=^^- ^ I ^'-1^ ^ A(qr) t 

[when q = 1, we have A(r, 1, A n ) = (A« r ^ 2 /m r )B(r, A n )]. 

It turns out that we also need some results concerning the behavior of 
A(p, A n ) t or A(r, q, A n ) when X is discontinuous. These results will be stated 
below, together with the consistency of the estimators of D(p)t which we 
presently describe. Estimators for D(p)t are a bit more difficult to construct 
because we need to evaluate o~ 2 _ and o 2 when s is a jump time, and this 
involves a kind of nonparametric estimation. A possibility, among many 
others, is as follows: take any sequence k n of integers satisfying 

(23) k n -> oo, k n A n -> 
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and then let I n ,t{i) = {j G N : j ^ i : 1 < j < [t/A n ] — < k n } define a local 
window in time of length 2k n A n around time iA n and 

i [*/A»] 

(24) £>( p ,A n ) t = — - £ | A? AT £ (Ap0 2 l { | A „ X | <QA?r} , 
n n i=l je/„, t (i) 

where a > and G (0, 1/2). 

The following theorem establishes the consistency of these estimators, and 
also contains the technical result (25) whose proof goes along the same lines, 
and which is needed for the consistency of the tests given below when the 
null hypothesis is that X does not jump. 

Theorem 4. (a) We have 
p>2, t>0, a>0, 



(25) 



1 1 1 ^ r A n A(2p, A n ) t p 
<vj < - =>• lim sup — ► U, 

2 P 2 n [ A{p,A n f t 



(26) rG (0,2), 9 GN =► l(r, g, A n ) t A(qr) t , 

locally uniformly in t. 

(b) If a > 0, zu G (0, 1/2) and p> 2, we have 

(27) B(p,A n ) t ^£>(p) t Vt>0. 
7/ further X is continuous, then 

(28) A^~ p / 2 D(p, A n ) t m p A(p + 2) t locally uniformly in t. 



Note that (27) and (28) are not contradictory, because D(p) = when 
A is continuous. In fact we have more than (25), namely the convergence 
(21) when there are jumps, but only under some restrictions on the jumps 
and also some restrictions on w connected with the structure of the jumps 
and the value p > 2; but these refinements are useless here. Of course (26) 
reduces to (22) when X is continuous. 



4.4. CLT for the standardized statistics. Using Theorem 4, we can im- 
mediately construct consistent estimators of the asymptotic variances of the 
test statistics established in (18) and (19), respectively. We deduce from 
(11), (26) and (27) the following CLT for the standardized test statistics 
[recall f \a s \ p ds > a.s., by (f) of Assumption 1]: 
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Theorem 5. (a) Let p>3 and t > 0. With 

(0Q , & _ A n (k-l)p 2 D(2p-2,A n ) t 

2B(p, A n )f 

the variables (V^ t )^ 1 ^ 2 (S(p, k, A n )t — 1) converge stably in law, in restriction 
to the set of (6), to a variable which, conditionally on T , is centered with 
variance 1, and which is N(0, 1) if in addition the processes a and X have 
no common jumps. 

(b) Assume in addition that X is continuous, and let p > 2 and t>0. The 
variables {V^ t )~ 1 / 2 {S{p, k, A n )t — k p / 2 ~ l ) converge stably in law to a variable 
which, conditionally on T , is N(0, 1), where V^ t is based on truncations: 

(30) ^ A n M { p,k)A(2 P ,A n ) t 

A(p,A n ) 2 

or is replaced by the multipower estimator: 

m) y C _ A n M(p,k)A(p/([p] + l),2[p} + 2,A n ) t 

A{p/{\p] + l),[p] + l,A n f t 

In (31) we have chosen r =p/([p] + 1) and respectively q = 2[p] + 2 and 
q = [p] + 1. Any other choice with r £ (0,2) and respectively q = 2p/r and 
q= p/r would do as well. 

5. Testing for jumps. We now use the preceding results to construct 
actual tests, either for the null hypothesis that there are no jumps, or for 
the null hypothesis that jumps are present. 

5.1. When there are no jumps under the null hypothesis. In a first case, 
we set the null hypothesis to be "no jump." We choose an integer k > 2 and 
a real p > 3, and associate the critical (rejection) region of the form 

(32) C c n>t = {S(p,k,A n ) t <c c n>t } 

for some sequence c c n t > 0, possibly c c nt = c% for all n, and possibly even a 
random sequence. 

The customary way for defining the asymptotic level of this test is as fol- 
lows: if a^ t (b,a,5) =P(C^ t ), a notation which emphasizes the dependency 
on the coefficients (6, a, 5) of (1), one should take the supremum over all 
triples (b, a, 5) in the null hypothesis (i.e., with 5 = 0) of limsup n a™(&, cr, 5). 
However, as mentioned at the end of Section 3.1, we only observe a particular 
path s i — > X s (lv) over [0, t], and even only at times iA n , so there is obviously 
no way of statistically separating the genuine null hypothesis from the case 
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where there are jumps, but none occurred in the interval [0,t]. Therefore, 
recalling the sets 0£ and £l J t of (6), it is natural to take the following as our 
definition of the asymptotic level: 

(33) a = sup lim sup P(C£ t \ Q%) , 

b,a,S n 

with the convention that P(- | Of) = if P(0£) = 0. Observe that we take 
first the lim sup and next the supremum; should we proceed the other way 
around, we would find a = 1. In a similar way, the power function for the 
coefficients triple (b, c, 5) is the following conditional probability: 

(34) p c n>t (b,a,6)=F(C* jt \ni). 

The right-hand side above makes sense as soon as the function 5 in restriction 
to 0, x [0, t] x E is not ¥(dw) ds A(dr)-almost everywhere vanishing, since 
F(n J t ) > in that case, whereas otherwise P(f2f > 0) = and our process 
is continuous on [0, t] (since our setting is nonhomogeneous in time, a test 
based on observations on [0, t] can of course say nothing about what happens 
after time t). 

For a € (0, 1), denote by z a the a-quantile of N(0, 1), that is, ¥(U > z a ) = 
a, where U is iV(0, 1). We have: 

Theorem 6. Let t > 0, choose a real p > 3 and an integer k > 2, and 

let 

(35) c^F/ 2 " 1 -^^, 

where V£ t is given by (30) with a > and 1/2 — 1/p < w < 1/2, or is replaced 
by % of (31). Then: 

(a) The asymptotic level (33) of the critical region defined by (32) for 
testing the null hypothesis "no jump" equals a. 

(b) The power function (34) satisfies P^ t (b,a,8) — ► 1 for all coefficients 
(b,o~,5) such that P(f^) > (and of course satisfying Assumption 1). 

5.2. When jumps are present under the null hypothesis. In a second case, 
we set the null hypothesis to be that "there are jumps," in which case the 
critical (rejection) region is of the form 

(36) Ci it = {S(p,k,A n ) t >ci tt } 

for some sequence c l nt > 0. As in (33), the asymptotic level is 

(37) a' = suplimsupP(C^ \n{), 
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with the convention that P(- | Q 3 t ) = if P(fif) = 0, and the power function 
for the coefficients triple (b, a, 5) is 

(38) /3i jt (b,a,S)=¥(Cl jt \nt). 

The right-hand side above is simply P(C^ t ) when 5 = 0, and it makes sense 
as soon as P(fi£) > 0, whereas when P(fi£ ) = 0, then the null hypothesis is 
of course satisfied. 

Theorem 7. Let i > and choose a real p > 3 and an integer k>2. Let 
a €(0,1) and w£ (0,1/2). 

(a) Letting 

(39) < t = 1 + \jvL/a, 

where V^ t is given by (29), the asymptotic level (37) of the critical region 
defined by (36) for testing the null hypothesis "there are jumps" is smaller 
than a. 

(b) Suppose that we restrict our attention to models in which the processes 
X and a have no common jumps. If 

(40) <t = l + W%> 

then the asymptotic level (37) of the critical region defined by (36) for testing 
the null hypothesis "there are jumps" is equal to a. 

(c) In all cases the power function (38) satisfies f3^ t (b,a,S) — > 1 for all 
coefficients (b,o~,8) such that P(fi£) > 0. 

5.3. Choosing the free parameters p, k, a and w. In both Theorems 6 
and 7 one has two "basic" parameters to choose, namely p > 3 and the 
integer k>2. For the standardization, and except if we use V£ t in Theorem 
6, we also need to choose a > and w, which should be in (1/2 — 1/p, 1/2) 
for the first theorem, and in (0, 1 /2) for the second one. Let us give some 
remarks regarding the selection of these parameters, and the impact of the 
choices on the test. 

First, about the real p > 3: the larger it is, the more the emphasis put on 
"large" jumps. Since those are in any case relatively easy to detect, or at 
least much easier than the small ones, it is probably wise to choose p "small." 
However, choosing p close to 3 may induce a rather poor fit in the central 
limit theorem for a number n of observations which is not very large, since 
for p = 3 there is still a CLT, but with a bias. A good compromise seems to 
be p = 4, since in addition the computations are easier when p is an integer. 

Second, about k: when k increases, we have to separate two points (1 and k 
when p = 4) which are further and further apart, but on the other hand in 
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(18) and (19) we see that the asymptotic variances are increasing with k. 
Furthermore, large values of k lead to a decrease in the effective sample size 
employed to estimate the numerator B(p, kA n )t of the test statistic, which is 
inefficient. So one should choose k not too big. Numerical experiments with 
k = 2,3,4 have been conducted below, showing no significant differences for 
the results of the test itself. More experiments should probably be conducted; 
however, we think that the choice k = 2 is in all cases reasonable. 

Third, about a and w. from a number of numerical experiments, not 
reported here to save space, it seems that choosing w close to 1/2 (as w = 
0.47 or w = 0.48), and a between 3 and 5 times the "average" value of a, 
leads to the left-hand sides of (27) and (28) being very close to the right- 
hand sides, for relatively small values of n. Of course choosing a as above 
may seem circular because at is unknown, and usually random and time- 
varying, but in practice one very often has a pretty good idea of the order 
of magnitude of at, especially for financial data. Even if no a priori order of 
magnitude is available, it is possible to estimate consistently the volatility 
of the continuous part of the semimartingale, (J a 2 s ds) 1 ^ 2 , in the presence 
of jumps; see the literature on disentangling jumps from diffusions cited 
in the Introduction. The multipower variations (22) do not suffer from the 
drawback of having to choose a and w a priori, but they cannot be used 
for Theorem 7, and when there are jumps the quality of the approximation 
in (26) strongly depends on the relative sizes of a and of the cumulated 
jumps. 

Finally, let us calculate more explicitly the critical regions for the val- 
ues p = 4. We obtain that M(4, 2) = 160/3, and more generally M(4, k) = 
16k(2k 2 — k — l)/3, and the cut-off point in (35) when further k = 2 becomes 




= 2 



2, 




160A n yl(8,A n ) f 
3A(4,A n ) 2 



or 



(41) 




= 2 



2, 



a 



160A n A(4/5,10,A n ) t 
3i(4/5,5,A n )f 



In a similar way, (40) becomes 



(42) 



c' 



l + z, 



8A n D{6,A n ) t 



'n,t 




The a-quantiles of N(0, 1) at the 10% and 5% level are Zq.i = 1-28 and 
-20.05 = 1-64, respectively. 
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5.4. The effect of micro structure noise. As already said before, the ob- 
servations of the process X are blurred with a (small) noise, which messes 
things up when data are recorded with high-frequency. 

We do not intend to provide a deep study of this topic here, but merely 
to establish some basic facts from the point of view of the consistency of our 
statistics (rates of convergence are more difficult to obtain). We assume that 
each observation is affected by an additive noise, that is, instead of Xi& n we 
really observe 5^a„ — ^iA n + £i, and the e$ are supposed to be i.i.d. with 
E(e\) and E(sf) finite. Then, instead of £>(4, A n ) t (we consider the case 
p = 4 here), we actually observe (for k = 1 and also k>2): 

[t/kA n ] 

B' (4, kA n ) t = ^2 (XikA n - ^(i-i)fcA„ + £fci - 

i=l 

[t/kA n ] 

= B(A,kA n ) t + 2 ^2 ( x ikA n - X(i_i) kAn ) (efcj - £fe(j_i)) 

i=l 

[t/kA n ] 

1=1 

The second term above behaves like 4:E(ef)B(2,kA n ) t and the third one 
like t{2E{ej) + QE(e^) 2 )/(kA n ). It follows that the statistics S'(4,k,A n ) t = 
-B'(4, kA n ) t /B'(4:, A n ) t that we actually observe instead of (12) has the fol- 
lowing behavior, as A n — ► 0: 

(43) S'(4,k,A n ) t -^± 

The relevance of this limit will become clear when we apply the test to real 
data in Section 7 below. Finally, note that when A n is moderately small, 
things may be different: if E(ef) and E(ef) are small (as they are in practice), 
S"(4, k, A n )t will be close to 1 on Q J t and to k on 

6. Simulation results. Throughout the simulations, we use p = 4. We 

calibrate the values to be realistic for a liquid stock trading on the NYSE. 
We use an observation length of t = 1 day, consisting of 6.5 hours of trading, 
that is, 23,400 seconds. The simulations contain no microstructure noise. 

Under the null of no jumps, the performance of the test in simulations 
is reported in Table 1. The distribution of the test statistic in simulations 
is close to the theoretical normal limit centered at k in simulations; his- 
tograms for the test statistic and the corresponding asymptotic distribution 
are shown in Figure 1 for the nonstandardized (top panel) and standard- 
ized test statistics (lower panel) and the cases k = 2 (left panel) and k = 3 



18 



Y. AIT-SAHALIA AND J. JACOD 



Table 1 







Level 


of the test under 


the null hypothesis 


of no jumps 




A„ 


n 


k 


Mean value of S(4,k, A„) 


Rejection rate in simulations 


Asymptotic 


Simulations 


10% 


5% 


1 sec 


23,400 


2 


2 


2.000 


0.098 


0.046 


1 sec 


23,400 


3 


3 


2.998 


0.099 


0.048 


1 sec 


23,400 


4 


4 


3.999 


0.094 


0.043 


5 sec 


4680 


2 


2 


1.998 


0.099 


0.044 


5 sec 


4680 


3 


3 


2.998 


0.099 


0.043 


5 sec 


4680 


4 


4 


3.995 


0.095 


0.038 


10 sec 


2340 


2 


2 


2.000 


0.093 


0.041 


15 sec 


1560 


2 


2 


1.999 


0.090 


0.043 


30 sec 


780 


2 


2 


2.007 


0.085 


0.035 



Note: This table reports the results of 5000 simulations of the test statistic under the 
null hypothesis of no jumps. The data generating process is the stochastic volatility model 
dXt/Xt = a t dWt, with a t = v] /2 , dv t = nifi-Vt) dt + ^v] /2 dB t , E[dW t dB t ) = pdt, /3 1/2 = 
0.4, 7 = 0.5, n = 5, p = —0.5. The parameter values are realistic for a stock based on 
Ait-Sahalia and Kimmel (2007). The test statistic is standardized with the estimator of 
V^ }t given in (30) with a = 5/3 1/2 and w = 0.47 in (21). 

(right panel.) The differences between the cases k = 2 and k = 3 conform to 
the theoretical arguments given above in Section 5.3, in terms of trade-off 
between variance and separation of the modes of the distributions. 

Under the null that jumps are present, the test statistic is now centered 
around the predicted value of 1. We report in Table 2 the performance of 
the test statistic when jumps are compound Poisson. Figure 2 plots the 
distribution of the test statistic for different values of the jump arrival rate, 
A. An interesting phenomenon happens, when multiple jumps can occur with 
high probability (A high). While the likelihood that jumps will take place in 
successive observations is small, such paths will happen over a large number 
of simulations. When that situation happens, we may see in U(4, 2A„)t the 
two jumps either compensate each other, if they are of opposite signs, or 
cumulate into a single larger jump if they are of the same sign. On the other 
hand, no such compensation or cumulation takes place in -6(4, A n )t- As a 
result, their ratio 5(4, 2, A n )t can exhibit a small number of outliers. 

Similar results for the case where jumps are generated by a Cauchy pro- 
cess are reported in Table 3 and in Figure 3 for the nonstandardized (top 
panel) and standardized test statistics (bottom panel), for different values 
of the Cauchy scale parameter 9. The contrast between the top (very far 
from normal) and bottom (nearly normal) panels in Figure 3 illustrates the 
role of the standardization in making the test statistic asymptotically nor- 
mal: recall from Theorem 3(a) that the nonstandardized statistic S(4, 2, A n )t 
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No Jumps: Distribution of the Statistic 




Fig. 1. Monte Carlo and theoretical asymptotic distributions of the non- standardized 
(top row) and standardized (bottom row) test statistics 5(4, k, A„) for k = 2,3 and A„ = 1 
second under the null hypothesis of no jumps. In the standardized case, the solid curve is 
the N(0, 1) density. 

is in general non-Normal unconditionally, whereas from Theorem 5(a) the 
standardized statistic (V^ t )~ 1 ^ 2 (S(4:,2,A n )t - 1) is asymptotically N(0, 1) 
unconditionally. 

In Figure 4, we illustrate what happens when there are paths containing 
either no jumps or jumps that are too tiny to be detected as jumps. In 
the Poisson case (left panel), we set A to correspond to 1 jump per day 
on average, but keep all paths, including those where no jump took place. 
Such paths with no jumps occur with positive probability, unlike the Cauchy 
case. But since we condition on having jumps, we removed those paths in 
Figure 2 to compute the distribution of the statistic under the null that 
jumps are present, as required under Theorem 5(a); this is the practical 
implication of the restriction to the set Q? t . Now, when those paths are kept, 
we obtain a clear-cut bimodal result: either no jump occurred, and those 
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Table 2 

Compound Poisson jumps: Level of the test under the null hypothesis that jumps are 

present 



Mean value of S(4, k, A„) Rejection rate in simulations 





n 


k 


Asymptotic 


Simulations 10% 


5% 










A = 1 jump per day 




1 sec 


23,400 


2 


1 


1.000 0.110 


0.056 


4 sec 


5,850 


2 


1 


1.002 0.107 


0.054 


15 sec 


1560 


2 


1 


1.003 0.100 
A = 5 jumps per day 


0.053 


1 sec 


23,400 


2 


1 


1.002 0.113 


0.057 


4 sec 


5850 


2 


1 


1.006 0.112 


0.059 


15 sec 


1560 


2 


1 


1.012 0.129 
A = 10 jumps per day 


0.071 


1 sec 


23,400 


2 


1 


1.002 0.105 


0.056 


4 sec 


5850 


2 


1 


1.009 0.134 


0.076 


15 sec 


1560 


2 


1 


1.029 0.163 


0.099 



Note: This table reports the results of 5000 simulations of the test statistic under the null 
hypothesis that jumps are present. The model under the null is dXt/Xt = at dWt + Jt dNt, 
where a t is the same stochastic volatility process with the same parameter values as in 
Table 1, Jt is the product of a uniformly distributed variable on [—2,-1] U [1,2] times 
a constant Js and N is a Poisson process with intensity A. The total variance of the 
increments, a 2 + (7/3)j|A is held constant at 0.4 2 . As a result, jumps that are more 
frequent tend to be smaller in size. In the simulations, 25% of the total variance is due 
to the Brownian motion and 75% to the jumps. Since the test is conditional on a path 
containing jumps, paths that do not contain any jump are excluded from the simulated 
sample and replaced by new simulations. Thus, in sample, the number of jumps is slightly 
higher than specified by the value of A in the table. The test statistic is standardized with 
the estimator of V% t given in (29) with k n = [50A~ 1/4 ] in (24). 

paths are grouped around 2, or (at least) 1 jump occurred and those paths 
are grouped around 1. 

In the Cauchy case (right panel of Figure 4), when a small value of 9 is 
selected, although each path has infinitely many jumps, a sizeable proportion 
of paths has no jump large enough to make the path look markedly different 
from that of a pure Brownian motion at our observation frequency. As a 
result, we get a bimodal distribution with a second mode at the (continuous 
case) value of 2. Compared to the left panel of the figure, the infinite-activity 
of the process produces a more diffuse situation where some paths have only 
tiny jumps, and the statistic takes intermediary values between 1 and 2. 

Finally, we confirm in simulations that jumps in a do not affect the dis- 
tribution of the test statistic; as predicted by Theorem 5, this is always the 
case if X is continuous, and when X jumps, remains the case as long as a 
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Poisson: 1 Jump per Day 
Non- Standard! zed 



Poisson: 10 Jumps per Day 
Non - S t andardi zed 




0.95 



Poisson: 1 Jump per Day 
Standardized 



Poisson: 10 Jumps per Day 
Standardized 



0.4 




Fig. 2. Monte Carlo and theoretical iV(0, 1) asymptotic distributions of the standardized 
test statistic (V„ ) -1 / 2 (S(4, 2, A„) — 1) for A n = 1 second under the null hypothesis that 
jumps are present, using the same data generating process with Poisson jumps as in Table 



and X have no common jumps. To check this, we repeat the simulations 
above with jumps added to a that are independent from those of X (if any): 
we simulate v = a 2 from the model used in Table 1 plus proportional com- 
pound Poisson jumps that are uniformly distributed on [—30%, 30%]. The 
results are largely unchanged from those in the tables above, and are not 
reported here to save space. 

7. Empirical application. We now conduct the test for each of the 30 
Dow Jones Industrial Average (DJIA) stocks and each trading day in 2005; 
the data source is the TAQ database. Each day, we collect all transactions 
on the NYSE or NASDAQ, from 9 : 30am until 4 : 00pm, for each one of these 
stocks. We sample in calendar time every 5 seconds. Each day and stock is 
treated on its own. We use filters to eliminate clear data errors (price set to 
zero, etc.) as is standard in the empirical market microstructure literature. 
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Table 3 

Cauchy jumps: level of the test under the null hypothesis that jumps are present 



Mean value of S(4, k, A„) Rejection rate in simulations 



A„ 


n 


k 


Asymptotic 


Simulations 


10% 


5% 










Jump size: 


9 = 10 




1 sec 


23,400 


2 


1 


1.002 


0.112 


0.062 


5 sec 


4680 


2 


1 


1.010 


0.117 


0.066 


15 sec 


1560 


2 


1 


1.027 


0.126 


0.081 










Jump size: 6 


= 50 




1 sec 


23,400 


2 


1 


1.000 


0.090 


0.051 


5 sec 


4680 


2 


1 


1.002 


0.100 


0.059 


15 sec 


1560 


2 


1 


1.003 


0.094 


0.059 



Note: This table reports the results of 5000 simulations of the test statistic under the null 
hypothesis that jumps are present. The model under the null is dX t /X± = at dWt + 9dY t , 
where a t is the same stochastic volatility process as in Tables 1 and 2, Y is a Cauchy process 
standardized to have characteristic function E(exp(mYj)) = exp(— t\u\/2). The value of the 
long-run mean of volatility, P 1 ^ 2 , is 0.2; the other parameter values are identical. Given /3, 
the parameter 6 measures the size of the jumps relative to the volatility. The test statistic 
is standardized with the estimator of V£ t given in (29) with k n = [50A n 1/4 ] in (24). 

We plot in Figure 5 a histogram showing the empirical distribution of the 
statistic computed on these data. As expected, we see evidence of market 
microstructure noise in the form of density mass below 1 (the limit is 1/2, 
see Section 5.4, if the noise is i.i.d., but may be different with other kinds 
of noise). The striking feature of the results is that most of the observed 
values are around 1, providing evidence for the presence of jumps, with only 
few observations around 2, the expected limit in the continuous case. As we 
sample less frequently, the distribution spreads out, consistently with both 
the asymptotic theory and the simulations above. 

8. Proofs. 

8.1. Preliminaries. We use the shorthand notation E^ l _ 1 (Y) for E(Y \ 
f(i-i)A n ), and we set 

5? = °"(i-i)A n A™W, Of = |A™X|l{| A n X |< QA w} 

for a given pair a > and w 6 (0, ^). We will consider a strengthened version 
of Assumption 1: 

Assumption 2. We have Assumption 1, and \b t \ + \a t \ + \b t \ + \a t \ + 
\a' t \ < K and |5(i,x)| < 7(2;) and \6(t, x)\ < 7(2:) and also j(x) < K for some 
constant K. 
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Cauchy Jumps: Distribution of the statistic 



50 
40 

30 
20 
10 



Cauchy Jumps 6 = 10 
Non -Standardized 



Cauchy Jumps 9 = 50 
Non- Standardized 




200 ) 



150 



100 



50 ■ 




0.9 0.95 



1.05 1.1 



0.975 



1.025 



Cauchy Jumps 9 = 10 
Standardized 



Cauchy Jumps = 50 
Standardized 




Fig. 3. Monte Carlo distribution of the nons-tandardized test statistic S(4, 2,A n ) for 
A n = 1 second under the null hypothesis that jumps are present, using the same data 
generating process with Cauchy jumps as in Table 3. The asymptotic distribution of the 
nonstandardized statistic is nonnormal as expected (see upper panel). However, the asymp- 
totic distribution of the standardized test statistic is N(0,1) (the solid curve in the lower 
panel). 



A localization procedure shows that for proving Theorems 2 and 4 we 
can replace everywhere Assumption 1 by Assumption 2; this procedure is 
described in detail in Jacod (2008) and works with no change at all here, so 
we omit it. 

Now we state a number of consequences of this strengthened assumption, 
to be used a number of times in the following proofs. Below^ K denotes a 
constant which may depend on the coefficients (b, c, S) and (b,a,a' ,5') and 
which changes from line to line. We write it K a if it depends on an additional 
parameter a. We also write X' t and X' t ' for the sum of the first three terms, 
resp. of the last two terms, in (1). First, for all q > 2 we have the following 
classical estimates [proved, e.g., in Jacod (2008)] under Assumption 2: 

Eti(|A?Xf + KT) < K q Af/ 2 , Ef_ 1 (\A2X"\ q ) < K q A n , 
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Tiny Jumps or No Jumps: Distribution of the Statistic 
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Fig. 4. Monte Carlo distribution of the non- standardized test statistic S(4,2,A„) for 
A„ — 1 second, computed using a data generating process with either one Poisson jump 
per day on average including paths that contain no jumps (left panel) or tiny Cauchy jumps 
(6 = 1, right panel). 



Empirical Distribution of the Test Statistic: DJIA30 All 200S Trading Days 
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FlG. 5. Empirical distribution of the nonstandardized statistic S(4, 2,A n ) for different 
values of the sampling interval A n . Each sample point is computed using all the transac- 
tions for one of the 30 DJIA stocks observed over one trading day in 2005. This produces 
7560 realizations of the statistic. 
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(44) 

E?_i(|A?o-|«) < K q A n , Eti(|A?X' - 5m < KA^/ 2 . 

Next, the proof of Lemma 5.12 of Jacod (2008) applied with e being the 
supremum of the bounded function 7 shows that under Assumption 2, 

(45) EtrOA^'f A if) < KA n (^±^ + T(9) 

for all 7] > and 9 G (0, 1), and where T(9) — > as — > 0. Combining this 
with the second line in (44) yields that also 

(46) EtdlAfX - 5?\ 2 A if) < KA n + T(0)) ■ 

Finally let us also mention two inequalities, valid for all x,y £ M. and 
< e < 1 < A and for p > 2 and < r < 2: 



(47) 
and 

(48) 



\\ x + y\ Pl {\x+y\<e} ~ \x\ P \ 

< K p (\x\n {{x>£/2} + e^ 2 {y 2 A e 2 ) + \x\*-\\y\ A e)) 

\x + y\ r — \x\ r \ 

< K r (e r + Ae + A r ~ 2 {x 2 + y 2 ) + A r e~ 2 {y 2 A 1)). 



These inequalities are elementary, although a bit tedious to show: for the first 
one, one singles out three cases, namely the case \x\ > s/2, the case \x\ > e/2 
and \y\ > e/2, and the case \x\ < e/2 < |y|, and proves the inequality in each 
of the cases. The second one is proved analogously, after singling out five 
cases: the case |x| < \y\, the case \x\ > \y\ and \x\ > A, the case \y\ < \x\ < e, 
the case \y\ < e < \x\ < A, and the case e < \y\ < \x\ < A. 



8.2. Proof of Theorem 2. For proving Theorem 2 we need to exhibit the 
limits Z(p), Z'(p, k), Y(p) and Y'(p, k) and it takes some additional notation 
to do so. We consider an auxiliary space (ST, J-', P') which supports a number 
of variables and processes: 

• four sequences (U q ), (U' q ), (U q ), (U q ) of iV(0, 1) variables; 

• a sequence {n q ) of uniform variables on [0, 1]; 

• a sequence (L q ) of uniform variables on the finite set {0, 1, — 1} (k>2 
is the integer showing up in the theorem); 

• two standard Wiener processes W and Pv ; 

and all these processes or variables are mutually independent. Then we put 



S] = Ux!l', T = T®T', P = p ( g)P / 
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and we extend the variables Xt,bt, ■ ■ ■ denned on Q and W, U q , . . . denned 
on £1' to the product £1 in the obviousjway, without changing the notation. 
We write E for the expectation w.r.t. P. Finally, denote by (T n ) n >i an enu- 
meration of the jump times of X which are stopping times, and let (F) 
be the smallest (right-continuous) filtration of T containing the filtration 
(Ft) and w.r.t. which W and W are adapted and such that U n , U' n , U n , 
U n , K n and L n are Ft„ -measurable for all n. We hence get an extension 
(n,F,(F t )t>o,$) of the original space (fi, F, (^) t > ,P). Obviously, W, W, 
W, W are Wiener processes and \i is a Poisson measure with compensator 
v on (n,f,(F t ) t >o,F). 

Now we exhibit the limits. If (h s ) and (h' s ) are adapted processes with 
right-continuous or left-continuous paths, defined on (Q, J 7 , (JPj)t>o,P), we 
set 

(49) Y(h, h') t = f h s dW s + f h' s dW' s . 

Jo Jo 

This defines a local martingale on the extension, which conditionally on 
the ex-field T is a centered Gaussian martingale. Furthermore if (k s ,k' s ) is 
another pair of processes we have 

(50) E(Y(h, ti) t Y(k, k') t | F) = [ (h s k s + h'X) ds 

Jo 

as in Jacod (2008). In a similar way, with any function g on R which is 
locally bounded and with g(x)/x — > as x — > 0, we associate the following 
two processes: 

(51) Z(g) t = ]T g(AX Tq )(^JT q U q a Tq . + yfl- 7 q U' q a Tq ), 

q:T q <t 

(52) Z\g) t = g(AX Tq )(jr q U q a Tq - + y/k-l- L q V' q a Tq ). 

q:T q <t 

That Z(g) is well defined is proved in Jacod (2008) and the proof that Z'{g) 
is also well defined is exactly the same. Again, conditionally on these two 
processes are independent martingales with mean and 

E(Z(g)* | F) = \Y,9{^X s f(al_+a% 

s<t 

(53) 

s<t 

Moreover, conditionally on Y(h,h') and (Z(g),Z'(g)) are independent, 
and the latter is also a Gaussian martingale as soon as the processes X and 
a have no common jumps. 
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At this stage we can proceed to proving Theorem 2(a). As a matter of 
fact, it is clearer to prove a more general result: with k an integer strictly 
bigger than 1 and with any function / on R, we consider the two processes 

[t/A n ] 

v n (f) t = E f( A ? x )> 

(54) 

[t/kA n ] 

y n u)t= E m n ki - k+ iX+A n ki _ k+2 x+...+Aix). 

i=l 

We also write V(f)t = 2~2 s <t /(A-Xs), which is well defined as soon as f(x)/x 2 
is bounded on a neighborhood of 0. 

Theorem 8. Under Assumption 1, let f be C 2 with f(0) = /'(0) = 
/"(0) = (f and f" are the first two derivatives). The pair of processes 

{^ {vn{f)t " v ^wA n] ),^=(v n (f)t - n/W/A„])) 

converges stably in law, on the product ID(R + , R) x D(R+, R) of the Skorokhod 
spaces, to the process (Z(f'), Z(f') + Z'(f')). 

We have the (stable) convergence in law of the above processes, as ele- 
ments of the product functional space B(R + , R) 2 , but usually not as elements 
of the space D(R + ,R 2 ) with the (two-dimensional) Skorokhod topology, be- 
cause a jump of X entails a jump for both components above, but "with 
a probability close to j/k" the times at which these two components jump 
differ by an amount jA n , for j = 1, . . . , k — 1. 

Proof. As said in the beginning of this section, we can assume As- 
sumption 2. The proof is an extension of the proof of Theorem 2.12(i) of 
Jacod (2008). We start the proof under the additional assumption that / 
vanishes on [— 2ke, 2ke] for some e > 0. Let S q be the successive jump times 
of the Poisson process /u([0,t] x {x:~/(x) > e}). Let £l n (t,e) be the set of 
all uj such that each interval [0,t] n (iA n , (i + k)A n ] contains at most one 
S q (u>), and Si(w) > kA n and \X( i+1 ) An (u) - X iAn (u)\ < 2e for all i < t/A n 
and \X( i+ i} kAn (<jj) — X ikAn (u>)\ < 2e for all i < t/kA n . Next, on the set 
{{ik + j)A n < S q < (ik + j + l)A n } for i > 1 and < j < k, we set 

L(n,q)=j, 

K(n,q) = ^-(ik+j), 



A 



Oi-(n,q) = -=(W Sq - W {ik+j)Ari 

V 
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a+(n,q) = —^=(W {ik+j+1]An -W Sq ), 



P-(n,q) = -=(W {ik+j)An - W ikAn ), 



P+(n,q) = -^={W {i+1)kAn - W {ik+j+1)An ] 



X{e) t = X t - AX s g , 

q:S q <t 

R-(n,q) =X(e)( ik+j)An - X(e) ikAn , 
R' + (n,q) = X(e)( i+1 ) fcAn - X(e)( ik+j+1 ) An , 

R n q = Al h+j+1)An X(e), R' q n = R n q + R.(n, q) + R + (n, ■ 



Rq = ^/KqU q as q - + V 1 _ Kq U 'q°S q , 



R- (<?) = JWJ q a Sq - , R+(q) = Jk-l- L q u'a s , 



q^Zq- 

We can extend the proof of Lemma 6.2 of Jacod and Protter (1998) in this 

C— (s) 

more complicated context, and with — ► denoting the stable convergence 
in law, to obtain that 

(L(n, q),K(n, q),a>- (n, q),a + (n, q), (n, q),(3 + (n, q)) q >i 
C-(s) 



(Lq, K q , y/Kq^Uq, J 1 — K q U q , \J L q U q , \Jk — L q — 1U q ) q >i- 

We deduce from this, as in Lemma 5.9 of Jacod (2008), that 
{iq/y/E^, R- (n, q)/\f\ l , R+(n, g)/v^) 5>1 

(55) 

C -M } (R q , R-(q), R + (q))q>\. 

Now, since f(x) = if \x\ < 2ke, on the set Q n (t,e) and for all s < t we 
have 

V n {f) s - V(f) An[s/An] = ]T (f(^X Sq + R n q ) - f(AX Sq )) 

q:S q <A n [s/A n ] 

f'(Rq)Rq, 

q: S q <A n [s/A„] 

where R q is between AX$ q and AX$ q + R q - In a similar way, we get 

V n (f) s -VU)kA nl s/kA n] = E f'(Rq n X 

q : S q <kA n [s/kA n ] 
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with R'™ between AX Sq and AX Sq + R'f. Since fl", R-(n,q) and R+(n,q) 

go to 0, we deduce that iff and also go to (for each u), whereas 
£l n (t,e) — > ft. Since /' is continuous, we readily deduce the result of the 
theorem from (55). 

Now we turn to the general case, where / does not necessarily vanish 
around 0. With ip p a C 2 function equal to 1 on [— p, p] and to outside 
[—2/?, 2p] and with f p = fip p , we have for all r\ > 0: 

limhmsuppfsup|y"(/ p ) s -V(f p ) An[s/An] \/^/A~> V ) =0 

n \ s <t J 

[this is (5.48) in Jacod (2008)] and we obviously have the same for V™ . Since 
we have the stable convergence in law when we take the functions / — f„, 
and since obviously Z{f' p ) and Z'(f' p ) converge locally uniformly in time to 
Z(f') and Z'(f'), respectively, we readily deduce the stable convergence in 
law for the function /. □ 

We now prove part (a) of the theorem. 

Proof of Theorem 2(a) under Assumption 2. The previous the- 
orem applied with the function f{x) = \x\ p (recall p > 3) yields the stable 
convergence in law of the processes 

=(B(p,An) t - B{p) An[t/An] ),—^=(B(p,kA n ) t - B(p) kAn[t/kAn] y) 

toward (Z(f) t ,Z(f') t + Z'(f') t ), and f'(x) =p\x\P' 1 sign(x). Hence if Z'(p, 
k)t = Z f (f f )t, the formula (15) follows from (53), and it remains to prove that 

(B(p) t - B(p) kAn[t/kAn] )/y/A n — >0 for all integers k. Since \6{u,t,x)\ < 
7(x) < K we have 

E(S(p) f - B(p) kAn[t/kAn] ) =E( \ AX s\ P ) 

\kAn[t/kA n ]<s<t / 

= E[ / ds I \5(s,x)\ p \{dx) ) 

\JkA n [t/kA n ] JE J 



< 



ds [ j(x) p X(dx) < KkA n , 

JE 



/fcA„[t/fcA„] 

where K = J ^{x) p \{dx) is finite (recall p > 3), and the result follows. □ 

For Theorem 2(b) it is also convenient to prove a more general result. 
Let g = (<?j)i<j<2 be an M 2 -valued C 2 function on M fc with second par- 
tial derivatives having polynomial growth, and which is globally even [i.e., 



30 Y. AIT-SAHALIA AND J. JACOD 

g(— x) = g{x)\ and set 

[t/kAn] 

V n {g)t= £ g(/\t^ k+1 X/^/\t l _ k+2 X/^...AliX/^n). 

i=l 

(56) 

We also denote by p y the normal law N(0,y), and by Py® its /c-fold tensor 
product, and by Py®{f) the integral of any function / w.r.t. it. With the 
above assumptions we then have the following (a d-dimensional version is 
also available, of course): 

Theorem 9. Under Assumption 1, assume in addition that X is con- 
tinuous. Then 

(57) A n V n (g) t V(g) t ■= \ 

Furthermore the processes ,\ (A n V n (g) — V(g)) converge stably in law to 
the two-dimensional process (Y(h, 0), Y(h', h")) [recall (49)], where the pro- 
cesses (ht), (h' t ) and (h") are such that the matrix (^,* ®„) is a square root 

of the matrix Oj = {® % t)i<i,j<2 given by 

(58) = \{ P k ®{gi 9j ) - P k a?(gi)p k ®( 9j )). 

Proof. The proof is similar to the unipower case in Barndorff-Nielsen et al. 
(2006a). We indicate the changes that are necessary. Everywhere the sums 
over % from 1 to [nt] are replaced by sums from 1 to [t/kA n ]. In (4.1) 
of that paper we replace /3" = ^/nau^iy n A^W by the collection /3™(j) = 

a k(i-l)A n ^k(i-l)+j^ f° r J = 1) • • • > Then the same proof as for Propo- 
sition 4.1 of that paper shows that if 

[t/kAn] 

v n { g )= £ 0(/ar(i),/ap(2),-. ■.#(*))> 

1=1 

we have (57) and the stable convergence in law with V n {g) instead of V n (g), 
and the same process V(g). Next, similarly to Theorem 5.6 of that paper, 
and with Q = g(A ki _ k+1 X / VA^, • • • , A^X/yfA^), the processes 

_ [t/kAn] 

^A~ n (C-E(CI^-DaJ) 

i=l 

converge stably in law to (Y(h, 0), Y(h', h")), and this easily yields (57) for 
V n (g). 
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For the stable convergence in law, it remains to prove that the array 

a = vs:(m i ^<.-»aJ - db Ca. * (9) *) 

satisfies Efif" 1 ICT I °- This is proved as in Barndorff-Nielsen et al. 
(2006a); the fact that g is a function of several variables makes no real 
difference, and since g here is C 2 we are in the case of Hypothesis (K) of 
that paper [and not in the more complicated case of Hypothesis (K 7 )]. □ 

Finally, we prove part (b) of the theorem. 

Proof of Theorem 2(b) under Assumption 2. We simply apply 
the previous theorem to the even function g with components 

9i(xi,..., Xk) = \x\\ p H h \x k \ p , 92(2:1,. ■ -,x k ) = \xi-\ \-x k \ p 

(recall that p>2, hence g is C 2 ). The matrix Ot of (58) is then 

S t u = (m 2p - m 2 p )\a t \ 2p , = (m fe , p - k p / 2 m 2 p )\a t \ 2p , 

Y 22 = k p ~ 1 (m 2p -m 2 p )\a t \ 2p . 

Then a version of the triple (/i, /i', /i") is given by ht = ce\crt\ p and h' t = a'\at\ p 
and h'( = a"\at\ p , where 

a = \/m,2r — m 2 , a = — {m k r — k r ^ 2 m 2 ), 
v a 

a" = J k r ~ 1 (m2r — m 2 ) — aa! . 

Hence the result obtains, with Y(p) t = Y(h,Q)t and Y'(p,k)t = Y(h',h")t, 
upon noting that (16) follows from (50). □ 

8.3. Proof of Theorem 3. 

PROOF, (a) Write U n = {A n )~ 1 / 2 (B(p, A n ) t - B(p) t ) and V n = (A„)~ 1/2 x 
(B(p,kA n ) t -B(p) t ). Then 

B( P ,kA n )l_ 1/2 V n -U n 



s( P , k, A n ) t - 1 = y; / l t - 1 = (A n ) 



B(p,A n ) t B(p,A n ) t 

Theorem 3 yields that under Assumption 1 and if p > 3, then V n — U n con- 
verges stably in law to Z'(p,k)t, and the result follows from (15). 

(b) Write U' n = (A n )- 1 /2(Ai- p / 2 J §(p, A n )-m p A{p)) and K = (An)" 1 / 2 x 
(Al~ p/2 B(p,kA n ) - k p ' 2 - l m p A{p)). Then 

S(p, k, A n ) t - k p l 2 ~ x = ^' fcAn) " - A*/ 2 - 1 = (An) 1 ^! kP/2 ~ 1U " . 

B{p,A n ) t B(p,A n ) t 
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Since — k p l 2 ~ l U' n converges stably in law to Y'(p, k)t — k p / 2 ~ 1 Y(p)t when 
p > 2 and X is continuous, the result easily follows from (16). □ 

8.4. Proof of Theorem 4- 

Proof of (25) under Assumption 2. We fix p > 2, t > 0, w e (1/2 - 
1/p, 1/2) and a > 0. Apply (47) to get, for any B > 1, 

(59) \U n (B) - U' n \ < Kp(A n {B) + A' n (B) + <(£)), 
where 

[t/A n ] [t/A n ] 

u n (B) = Air^ J2 \Kx\ p h\A 7 x\<^ } , K = ^n Pl2 E \^ x '\ p 

i=l i=l 

and 

A n (B) = A* p l 2 E l'^r^T 1 {|A™X'|> v / 5A^/2}' 
i=l 

[t/An] 

<(i?)= J B^ 1 £ (|A?X"| 2 A£A re ), 

i=l 

[t/An] 

<(i?) = A^/ 2 £ lA-A'r^lA^'lAv/SA;). 

Then if we apply (44) and (45) with 7] = \JB/S. n and 9 = A^ 4 , and also 
Bienayme-Chebyshev for A n (B) and Cauchy-Schwarz for A'^(B), and with 

the notation e 2 n = Al/ 2 + r(A« //4 ) (so e n — > 0), we readily obtain (recall 
£ > 1): 

HMB)) < E«(B)) < K P tB p / 2 e 2 n , E«(fl)) < K p tB l / 2 e n . 

It follows that (with other constants A p ): 
P(4,(fl) > 5- 1 / 2 ) < Jf^S" 1 /*, P(A' n ( J B) + > £ y 2 ) < A p feV 2 . 

On the other hand we can apply the last property in (11) to X' , which 

IP 

satisfies Assumption 2 and is continuous, to get U n — ► m p A(p) t . Combining 
this with the above estimates and (59), and since e n — > 0, we obtain for some 
constants K and K' depending on p and t, and all rj > and B > 1: 

(60) limsupP([/ n (£) < m p A(p) t - KB~ l/2 -rj)< K' B~ l/2 . 

n 

Now, for any B > 1 we have aA" > sJBA n for all n large enough be- 
cause w < 1/2. Therefore (60) remains valid if we replace the sets {|A™X| < 
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V^BA^} by {\A?X\ < aA™} in the definition of U n {B). Since A(p) t > 
a.s. and B is arbitrarily large and r/ arbitrarily small, we deduce that C n = 
E [t/A„] | A fX|n { | A n X |< QA w } satisfies 

1 2 



(61) P 



> 



1-P/2 C m p ^(p) t 



m p A(p) i 



0. 



At this stage, the proof of (25) is straightforward. Indeed, since | A™X\ 2p < 
oPAP™\A™X\p when |AfX| <oA ro , one deduces from (21) that 

pm+l—p/2 



A n A{2p,A n ) t < i^A^ 



KAr 



A(p,A n ) t 2 ov, A^O, 



Since pro + 1 — p/2 > 0, the result readily follows from (61). □ 



Proof of (26) under Assumption 2. Barndorff-Nielsen, Shephard 
and Winkel (2006b) proved a similar result under more restrictive conditions. 

Apply (45) with 7] = a/A^ and 6 = A^ 4 , and divide par A n , to get 

(62) E^idAfX'Vv^l 2 Al)<a n 

for a (deterministic) sequence a n going to 0. Next, we apply (48) with < 
r < 2 and x = AfX'/<jA~^ and y = AfX"/^/A^ and we use (44) for g = 2 
and (62) to get 

E^iWA^X/ VAX - |A?X7VA^r|) < + Ae + A r ~ 2 + AV" 2 a n ). 
Hence by taking e = e n = all 4 and A = A n = a n we get as soon as a n < 1: 

(63) A-/ 2 supE?_ 1 (||A?Jrr - |A?XT I) < a' n = i^/ 8 + a^ 2 "^ 8 ) - 0. 

Now, from Barndorff-Nielsen et al. (2006a), we know that the result holds 
when X is continuous: the processes V' (r,q, A n ) defined by (22) with X 
substituted with X' converge in probability, locally uniformly in time, to 
the process A(qr) (this holds even when r > 2). Therefore it is enough to 
prove that 



(64) 



E[sup\V{q,r,A n ) s -V'(q,r, A n ) 

\s<t 



0. 



The left-hand side of (64) at time t is smaller than J2f=f n> ^(Cf )> where 



Si 



A 



l-qr/2 



m ■ 



n i a " + ,-i 



9-1 

■ n i a ^- 



m, 



c(o = A i~ rq/2 n i A r+j-ixniA™ +/ _ 1 A| r - |A™ +i „ 1 A / i r i n i^-i^r, 
j=i j=/+i 
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where an empty product is set to be 1. Taking successive conditional expec- 
tations, and using (44) and (63), we readily obtain that E{Q^(l)) < Ka' n A n . 

Then obviously £i=f n] E(Q (I)) < Kta' n -> for each l = l,...,q, hence 

(64) . □ 

Lemma 1. Under Assumption 2 and if 

i [t/A„] 

(65) D'f := —- \ A ? X \ P E (^) 2 ^D(p) t 
and also, when X is continuous, 

i [t/An] 

(66) A^=— 7^ E 1ST E (SJ) 2 ^m p A(p + 2) t , 

K n A n i= i jel n ,t(i) 

then (27) holds, as well as (28) when X is continuous. 

Proof. In (28) both members are increasing in t and the limit is con- 
tinuous in t, so it is enough to prove the convergence separately for each t. 
To unify the proof of the two results, we write u n = l and D = D{p) if X 

jumps, and u n = A n P//2 and D = m p A(p + 2) if A" is continuous, in which 
case we also set 

[t/A n ] 

D't n :=r^- E E (*;) 2 - 

^ IP 

With this notation, (27) and (28) amount to u n D(p, A n ) t — > D t , and in a 

first step we show that this is equivalent to D t n — > L>£ • 

Apply (47) with p = 2 and x = 5f and y = AfX - 5? and 9 = aA™, and 
use Cauchy-Schwarz and Bienayme-Chebyshev and (44), and also (46) with 

1/2 

V = 7 ln = oA^ 7 and e = r] n ' , to get after some simple calculations that, with 



K = K a : 



J-E(|(C) 2 -(^) 2 ||^ (i _ 1)A J 



A 

(67) 



< f n := K(A n + A- + r(r?y 2 ) + A^ 2 + v/r(r?n /2 )) - 



1/2 

where the final convergence follows from r\ n — > 0, hence T(rj n ) — > as well. 
Observe that u n D(p,A n ) is the same as D' n , with <5" substituted with 0". 

Hence the difference u n D(p, A n ) t — D t n is the sum of less than 2/c n [£/A n ] 
terms (strictly less, because of the border effects at and [£/A n ]), each one 
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being smaller than 1 ^\A r } X\ p \\{e]) 2 - (<5™) 2 |, for some i^j. Then, by 
taking two successive conditional expectations and using (44) and (67), we 
see that the expectation of such a term is smaller than K p A n T n /k n in both 

cases (X continuous or not). Therefore E(\u n D(p, A n ) t — D' t n \) < K p tT n . 
Since T n — ► 0, the claim of this step is complete. 

So far we have proved that (65) implies (27). For proving that (66) implies 

(28) when X is continuous, it remains to show that in this case E(\D' t n — 
D t n \)^0. Exactly as above, D t n - D t n is the sum of less than 2k n [t/A n ] 
terms, each one being smaller than • = ^^-||AfX| p | - |£™| p |(5™) 2 , for 
some i / j. Since ||x + y\ p — \x\ p \ < -ftr p (|x| p_1 |2/| + \y\ p ), and since X = X' 
because X is continuous, we deduce from (44) and by taking two successive 

conditional expectations that j5(£Jj) < K p u n A}/ 2+p ' 2 /k n , hence E{\D' t n — 
D t n \) < KptyJ A n , and we are done. □ 

Proof of (27) under Assumption 2. Step 1. For any p e (0, 1) we set 
ip p (x) = 1 A (2 — \x\/p) + (a continuous function with values in [0, 1], equal to 
1 if |x| < p and to if \x\ > 2p), and we introduce two increasing processes: 

1 [t/An] 

^ = rr E ^(^IW E ( 5 i) 2 > Z(p)? = S'?-l r (p)?. 

J — 1 j£ln,tW 

By the previous lemma we need to prove (65), and for this it is obviously 
enough to show the following three properties, for some suitable processes 
Z{p): 

(68) limlimsupE(F(p)n = 0, 

(69) PG(0,1), n-oo Z(p)?^Z(p) t , 

(70) p^0 Z(p) t -^D t . 

Step 2. By singling out the cases 2\x\ > \y\ and 2\x\ < \y\, we check that 
(recall p < 1): 

^(x + y)\x + # < K p (|x| p + (y 2 A p 2 )). 

Using this with x = 5f and y = AfX — 5f, plus (44) and (46) with i] = p 
and = y/jo, we obtain 

- x -E?„ 1 (^(A^)| A?Jrp) < I* B (p) := i^A^ 2 " 1 + p + T{Jp) + A n p~ l ). 



X 



Now, Y(p)t is the sum of less than 2k n [t/A n ] terms, all smaller than 
|A"X| P |5™| 2 for some i ^ j. By taking two successive conditional expecta- 
tions, as in the previous lemma, and by using (44) and the above, we see 



36 Y. AIT-SAHALIA AND J. JACOD 

that the expectation of such a term is smaller than K p A n T' n (p) /k n . Thus 
E(Y(p)2) < KtT' n (p), and since obviously lim^o limsup n T' n {p) = we ob- 
tain (68). 

Step 3. Now we define the process Z(p). Let us call T q (p) for q = 1, 2, . . . 
the successive jump times of the Poisson process /x([0,t] x {x:j(x) > p/2}), 
and set 

z( P ) t = l A ^( P) i p (i-V'p(Ax Tp(p) ))(4 9(p) _+4 9(p) ). 

q:T q (p)<t 

For all u G ft, q> 1, p' S (0,p) there is q' such that T q (p){uj) = T g /(p')(u;), 
whereas 1 — ijj p increases to the indicator of K\{0}. Thus Z(p) t (uj) ] D(p) t (io), 
and we have (70). 

Step 4. It remains to prove (69). Fix p £ (0, 1) and write T q = T q (p). Recall 
that for u different from all T 9 's, we have |AX U | < p/2. Hence, for each ui 
and each t > 0, we have the following properties for all n large enough: there 
is no T q in (0,k n A n ), nor in (t — (k n + l)A n ,i\; there is at most one T q in 
an interval {{i — l)A n ,iA n ] with iA n < t, and if this is not the case we have 
ip p {A r /X) = 1. Hence for n large enough we have 

z{p) n t = E C 

<T q <t—(k n +l)A„ 

where 

Q = ^\A^ q) Xf(l-^A^ q) X)) y: 

]&T(n,q) 

and i(n,q) = inf(z:iA n > T q ) and l'(n,q) = {j :j / i(n,q), \j-i(n,q)\ < k n }. 

To get (69) it is enough that Q \^X Tq \ p {l - ^ p (AXr g ))(4,- + a T q ) 
for any q. Since \A^ q) X\?{l - ^(A^X)) -+ \AX Tq \P(l - i p (AX Tp )) 
pointwise, so it remains to prove that 

T^r E (^) 2 -4 9 _, E (*?) 2jL -4„ 

where I'_(n,q) and I' + (n,q) are the subsets of I'(n,q) consisting in those j 
smaller, respectively bigger, than i(n,q). We write 

s" = inf cj2, 

y «e[T q -fc n A„,T 9 ) 

= sup a\. 

U£[T q -k n An, Tq) 
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On the one hand, both s™ and S™ converge as n — > oo to Oj< _, because 
/c n A n — > 0. On the other hand, the left-hand side of the first expression in 

(71) is in between the two quantities s™ U™ and SqU™. Moreover the variables 

AfW are i.i.d. N(0, A n ), so U£ is distributed asU' n = ^ J2i=i Vi where the 
ViS are i.i.d. N(0, 1), and by the usual law of large numbers we have U' n — ► 1 

a.s., hence C/™ — ► 1; these facts entail the first part of (71), and the second 
part is proved in the same way. □ 

Proof of (28) under Assumption 2. Step 1. We have to prove that 
when X is continuous, then (66) holds. We have 

[t/A n ] 

A -p/2 £ | < 5T»|P+2JU mp+2 A(p + 2) t 

i=l 

[this property is the preliminary step in Jacod (2008) to prove the last part 

of (11); alternatively, it can be deduced from (11) exactly as D'f — D"™ -^U 
in the end of the proof of Lemma 1] . Therefore it is enough to prove that 

i [*/A„] 
fcn^n i=l jel n ,t{i) 

where Q = m p+2 |5?| p |<^| 2 - to p |<5™|p +2 . 
Let T(n, i) = (i — k n — 1) + A n . When j > i we have (recall <tj is bounded): 
|E?-i(C"OI = K-i(m P+ 2W^ 1)A J(a 2 u _ 1)An - 4_ 1)A J|A«^nA^| 2 
+ \a {l . 1)A J +2 (m p+2 \A^WnA]W\ 2 - m p \A?W\*> +2 ))\ 

<K p A n KtAH^)A n -4-mJ\^ W \ P \)- 
When i — k n < j < i we have 

|e(C™-I^-i)aJI 

= \E(m p+2 \a {j _ 1)An \ 2 (\a {i _ 1)An f - \a {j _ 1)An \ p )\ A?W\*\A]W\ 2 

+ \a u _ l)A S +2 (m p+2 \^WnA]W\ 2 - m p \AfW\^ 2 ) \ T {j . 1)An ) 

<K p A?/ 2 E(\\a {l . 1)A J-\a u _ 1)A ^^ 

Therefore, since \\(T (i _ 1)A J q - |o- (i _ 1)A J«| <K q \a {i _ 1)An -a {j _ l)An \ for any 
q > I, we deduce from (44) and Cauchy-Schwarz and the two estimates 
above that 

(72) jel n , t (i) \nC, 3 \r T(nA )\<K p A^ 2 . 
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Moreover, as a trivial consequence of (44) again, we get 

(73) jGl n , t (i) => E(|C^| 2 |^(n, 4 ))<^ P A^. 
Now we set 

, [*/A„] 
^ = —2 E = E E W I ^(n,i)), ^ = *f " Z n . 

k n /S. n jg/ nt i=i 

On the one hand (72) yields |Z n | < K p t\J A n — ► 0. On the other hand Z' n = 

Ei=i (Vi-HVi I ^(n,*)))) whereas r?" is ^T(n,i+2fe„+i) -measurable. Hen- 
ce (73) gives 

E(Z'l) = Yl E (W " E « I ^T(n,i)))fa? " E($ | ^ T (n/)))) 

»,»' : l<i,i'<[t/A„] 

: l<i<[t/A n ],|i-i'|<2fc n +l 

x(^-E(^|^ T(fl|i0 ))) 
< ]T E (l«l) < (4A; n + 3)[t/A n ,]A p A2 

M' : l<i<[t/ A„],\i-i'\<2k n +l 
<K p t(k n A n ), 

p 

which goes to by virtue of (23). It then follows that Y™ — > 0, and we are 
done. □ 

8.5. Proof of Theorem 6. 

Proof, (a) When X is continuous the variables U n = (V^ jt ) _1 / 2 (S(p, k, 

A n )j — k p l 2 ~ l ) converge stably in law to N(0, 1) by Theorem 5(b), for both 
choices of V£ t . In particular if c c nt is given by (35) we have that F(C^ t ) = 
P(U n < —z a ) — ► a, and we even have more, because of the stable convergence 
in law; namely, for any measurable subset B of f2, then 

(74) nC c n ,t n B) = F({U n < -z a } HB)^ aF(B). 

Now, this is not quite enough to prove (a), since it may happen that X 
is not continuous but the observed path is continuous on [0,t]. To deal with 
this case, for any integer iV we introduce the process 

xl N) =X + I b s ds+ I a s dW s - I' S'A mn<m ds. 



Put an additional exponent ^ for the variables defined on the basis of X^ , 
writing, for example, V^t or <^(p> ^> ^n)t or ^nt ■ Then (74) applied 
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with the continuous process 1^ shows that P(C£ ( f ) D B) ^ aP(B) for 

any B £ J 7 . However, on the set £l[ N ^ where X s = for all s 6 [0,i] we 

have y n c f } = V° t and 5(p, fe, A n )[ N) = S(p,k,A n ) t , hence n = 

n n ( t N) . Therefore P(C* t n 5#°) -» aP(0^ } ). Since fi^ increases to 
fi£ by virtue of (e) of Assumption 1, we deduce (i). 

(b) Finally assume P(fif') > 0. Then Theorem 1 yields that S(p, k, A n ) t 
1 on the set On the other hand if we use the version (30) for V£ t we 

deduce from (25) that cjj t — — > hP/ 2 ^ 1 > 1, whereas if we use the version (31) 

we have V^ t /A n M(p, fc)A(2p) t /A(p)f , hence again c£ t k p l 2 ~ l > 1; 
so the result is obvious. □ 



8.6. Proof of Theorem 7. 



Proof. Relative to the proof of the previous theorem, we need a few 
changes. First we replace (a) of Theorem 6 by two statements (a) and (b) 
here: the case (b) corresponds to the situation where the limit in Theorem 
5(a) is normal, and so this is similar to Theorem 6(a). The case (a) here cor- 
responds to a nonnormal limit with variance 1, and for this limit we cannot 
evaluate exactly the quantiles and we rely upon the Chebyshev inequality; 
this is why we only get a bound on the level but not the exact value. Apart 
from these changes, the proof for the level is the same. 

For (c) we suppose that P(fi£) > 0. Letting T = inf (s : AX S ^ 0), we have 
T > t on Q t , whereas fi({(s, z) : s < T,5(s, z) ^ 0}) < 1, hence K(u({(s, z) :s < 
T,5(s,z) 0})) < 1, hence a fortiori the predictable process Y s = Jq J E \ko 
S(r, z)\u(dr,dz) is finite- valued on [0, T], so there is an increasing sequence 
(T q ) of stopping times with T q <T and Y Tq < q and F({T q <t}n fi£ )t ) -► 

P(ft£ if ), and it is clearly enough to show that P(C^ | {T q < t} n O^J -> 1 
for all q having F({T q <t}f] Q,^ t ) > 0. 
Now with q fixed we consider the process 

rs rs rsAT q r 

X s = Xq + / b r dr+ / a r dW r — / / k o <5(r, z)u(dr, dz). 
Jo Jo Jo Je 

This process satisfies Assumption 1 and is continuous, and it coincides with 
X on the interval [0,T q ). Then similarly to the previous proof, we see that 

on the set {T q <t}f] n£ >t we have S(p, k, A n ) t h v l 2 ' x and [use 

(23) and (28) for the latter], hence the result. □ 
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